Enhancement of text representations using related document titles

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extracting Lexico-semantic Relations from Document Titles Using Sublanguage Analysis

The sublanguage analysis methodology is based on the theory that texts generated by a group of people possess their own lexical, syntactic, and semantic characteristics. The main thrust of the research reported here is to apply the methodology to Korean document titles, reflecting the fact that there has been little research in applying the methodology for the Korean language although much rese...

متن کامل

Text Categorisation Using Document Profiling

This paper presents an extension of prior work by Michael D. Lee on psychologically plausible text categorisation. Our approach utilises Lee’s model as a pre-processing filter to generate a dense representation for a given text document (a document profile) and passes that on to an arbitrary standard propositional learning algorithm. Similarly to standard feature selection for text classificati...

متن کامل

Classifying Document Titles Based on Information Inference

We propose an intelligent document title classification agent based on a theory of information inference. The information is represented as vectorial spaces computed by a cognitively motivated model, namely Hyperspace Analogue to Language (HAL). A combination heuristic is used to combine a group of concepts into one single combination vector. Information inference can be performed on the HAL sp...

متن کامل

All-in Text: Learning Document, Label, and Word Representations Jointly

Conventional multi-label classification algorithms treat the target labels of the classification task as mere symbols that are void of an inherent semantics. However, in many cases textual descriptions of these labels are available or can be easily constructed from public document sources such as Wikipedia. In this paper, we investigate an approach for embedding documents and labels into a join...

متن کامل

A Novel Thresholding Method for Text Separation and Document Enhancement

Many thresholding-based image enhancement techniques have been developed and used for document analysis, where the simplicity and efficiency of thresholding makes it ideal to use for classifying layers within documents. However, the efficiency of these enhancement techniques can be impaired by the variation of grey levels in different documents, thus causing over-thresholding or under-threshold...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information Processing & Management

سال: 1986

ISSN: 0306-4573

DOI: 10.1016/0306-4573(86)90073-7